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DIVISION  OF  MATHEMATICAL  SCIENCES 
Department  of  Statistic 
Southern  Methodist  University 
Dallas,  Texas  75275 


MODELING  SEASONAL  ABMA  PROCESSES 


Jeffrey  D.  Hart  and  H.  L.  Gray 
Southern  Methodist  University 

INTRODUCTION 

Gray,  Kelley,  and  Mclntire  (1978)  have  introduced  a  method,  based 
on  arrays  of  numbers  called  R-  and  S-arrays,  for  identifying  p  and  q  in 
an  ARMA(p,q)  process.  In  addition,  they  have  illustrated  how  the  same 
method  is  useful  in  detecting  nonstationary  factors  in  an  observed  process, 
and  in  suggesting  an  appropriate  transformation  to  stationarlty.  In  the 
present  paper  special  attention  is  given  to  the  problem  of  modeling 
seasonal  ARMA  processes  using  the  S- array  method.  A  general  definition  is 
given  for  a  seasonal  process,  and  the  procedure  for  identifying  and  model¬ 
ing  such  processes  is  discussed  in  detail.  Additionally,  an  Interesting 
theorem  characterizing  the  S-arrays  (based  upon  the  sample  autocorrelation) 
of  seasonal  processes  is  stated  and  a  proof  indicated.  Finally,  a  data 
set  (the  international  airline  data)  which  exhibits  the  properties  of  a 
seasonal  process  is  analyzed  using  the  method  discussed,  and  two  models 
for  the  data  are  proposed. 

I.  Definitions  and  Theorems 

The  following  definitions  and  theorems  provide  the  motivation  for 
the  remainder  of  the  paper. 

Definition  1.  A  stochastic  process  {Ke>,  t  »  0,  ±1,  ±2,  ...  is  said  to 
be  autoregressive  of  order  p  and  moving  average  of  order  q,  or  ARKA(p,q),  if 
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p  q 

xt  *  £  *kXt-k  +  zt  "  l  9k2t-k  *  (i) 

k-l  k-1 

where  the  and  8^  are  constants  and  {2^}-  is  a:  white  neiae  process  with 
finite  variance.  If  we  define  the  operator  B  by  BXfc  ■  then  (1)  nay 

be  written  as 

♦(B) Xt  -  8(B)Zt  (2) 

where  ♦(B)  »  1  -  $^B  -  <fr2B  -  ...  -  ^BP  and 

9(B)  -  1  -  8.B  -  0.B2  *  ...  -  8  . 

I  Z  g 

It  is  well  known  that  {X£}  is  stationary  if  and  only  if  all  of  the  roots 
of  ♦(x)*0  lie  outside  the  unit  circle.  For  our  purposes ,  a  nonstationary 
process  will  be  one  for  which  one  or  more  of  the  roots  of  ♦(x)«0  lie  on  the 
unit  circle. 

Definition  2.  Suppose  {X  }  is  an  AKMA(p,q)  process  with  <fr(B)X„  - 
9(B)Zt-  A  factor  a(B)  of  <fr(B)  will  be  called  a  seasonal  factor  if 

a(B)  »  1— n^B  +  B2  ,  ]<VjJ  <2  or 

a(B)  -  1  +>  B  ; 

i.e.  a  factor  a(B)  is  seasonal  if  the  algebraic  equation  a(x)  ■  0  has 
complex  roots  on  the  unit  circle  or  the  root  -1. 

Definition  3.  An  ABMA(p,q)  process  will  bis  referred  to  as  a  seasonal 
process  if  it  has  one  or  more  seasonal  factors. 

Definition  4.  Let  m  be  an  Integer  and  f  be  a  real  valued  function. 

.  i 

Further,  let  f  »  f  (m) , 
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A  recursion  relationship  for  calculating  S-array  values  and  a 
complete  discussion  of  how  S-arrays  may  be  utilized  in  identifying 
p  and  q  for  an  ARMA(p,q)  process  may  be  found  in  Gray,  Kelley,  and 
Mclntire  (1978) . 

Definition  5.  Denote  by  . ...  X_)  the  autocorrelation  function  of 

■’  —  IS  X  p 

a  stationary  ARMA(p,q)  process  whose  characteristic  equation  has  roots 
X2,  Xp.  Let  r^,  r^,  ...»  rp  be  the  roots  of  the  characteristic 

equation  of  an  ARMA(p,q)  process  with  j  roots  on  the  unit  circle,  which 
for  convenience  are  denoted  by  r^^,  r^,  ...»  Now,  define 

C  *  Pm(rl’r2,'**,rp)  ’  *  u° 

and 

s»  *  Varr"2 . “i-Vi . V’ J  '  1>2 . .  ’ 

Theorem  1.  Suppose  an  ABMA(p+j,q)  process  has  j  roots  of  its  charac¬ 
teristic  equation  on  the  unit  circle  and  that  M  of  these  j  roots  are  of 

highest  multiplicity.  Let  rlfr2...,rM denote  the  M  distinct  roots  on  the 

* 

unit  circle  which  are  of  highest  multiplicity.  Then  p^  satisfies  a  linear 
homogeneous  difference  equation  of  order  M  whose  characteristic  equation  is 

(1-r"1*) (l-r'1*)  ...  (l-r”1*)  ■  0  . 

For  a  proof  of  this  theorem  see  Findley  (1978)  or  Quinn  (1980) .  The 
•importance  of  Theorem  1  lies  in  its  suggestion  of  the  probable  behavior 
of  the  sample  autocorrelation  function  from  aonstationary  ARMA  processes. 

The  next  two  theorems  will  illustrate  how  S-arrays  may  be  used  to  take 
advantage  of  the  information  in  Theorem  1. 

For  the  remainder  of  this  paper  we  shall  define  the  autocorrelation 

*  .  * 
of  a  nonstationary  process  as  oa  and  hence  will  write  s  p^. 


Theorem  2.  Let  £  denote  either  p_  or  (-l)mp  where  p  is  the  auto-' 
m  n  ni  in 

correlation  function  of  an  ABMA(p,q)  process.  Then  the  condition  that 

S„ (f  )  is  constant  (as  a  function  of  m)  for  some  1  <  p  is  both- necessary 
i  m  “ 

and  sufficient  for  nonstationarlty.  Further,  l  *  M  (where  M  is  the  same 
as  that  in  Theorem  1)  and 

M  M 

W  5  cf  “  t1  “kI1af,kBk]  wher* 

pm  "  8lpm-l  "  *•*  *  *H  Pm-H  *  0  for  m  >  0  and 


[(-l)k,  tm  z  (-1>\  . 

The  proof  of  this  theorem  follows  from  the  proof  of  theroem  9  in 
Gray,  Kelley,  and  Mclntire  (1978)  and  from  the  proof  of  the  previous 
theorem  In  this  paper. 

Theorem  3.  Suppose  {X^>  is  an  ABMA(p.q)  process  and  that 
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t  (*t  -  hi 
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Then  {XJ-  is  nonstationary  if  and  only  if  p-lim  S  (P„(k) )  =  c  for  some  m<p. 

N-Hi  B  N 

Moreover,  if  S  (0.  )  is  defined  for  all  k,  then 

m  K 

m 

c  •  (-l)m(l  -  [  S^) ,  where 
k-1 


*•*  ~8mpk-m 


0  . 


pk  “  8lpk-l  ~ 


This  theorem  has  been  proven  by  Findley  in  personal  communications 
to  H.L.  Gray  and  by  Morton  (1980) . 

Theorems  1,  2,  and  3  provide  the  basis  for  modeling  seasonal  data. 
Theorems  1  and  2  Indicate  respectively  the  effect  which  seasonal  factors 
have  on  the  autocorrelation  function,  and  the  way  in  which  seasonal 
factors  are  manifested  in  S-arrays.  Theorem  3  assures  us  (at  least  for 
reasonably  large  sample  sizes)  that  the  constancy  behavior  which  charac¬ 
terizes  the  parametric  S-array  of  a  seasonal  process  will  also  be  apparent 
in  the  S-array  based,  upon  the  sample  autocorrelation. 

II.  Some  General  Remarks 

Before  considering  a  specific  example  of  real  data,  it  will  be 
helpful  to  consider  an  example  which  illustrates  the  consequences  of 
Theorems  1  and  2.  Suppose  the  process  {Xt>  is  given  by 

(1-B)2(1-^2B  +  B2)<KB)Xt  -  8  (B)Zfc 

where  the  zeroes  of  *(x)  are  all  outside  the  unit  circle.  Note 

that  the  equation  (1-x) 2(l-/2x  +  x2)  *  0  has  the  three  distinct  roots  1, 

y(l+i),  and  y(l-i)  which  are  all  on  the  unit  circle.  However,  since 

the  root  1  is  repeated.  Theorem  1  implies  that  Px(m),  the  limiting  auto¬ 
correlation  function  of  {Xt>,  satisfies  the  first  order  difference 
equation 

y  -  y  .  ■  0  . 

■'m  m-1 

If  Wfc  -  (l-B)Xt,  we  can  make  the  further  observation  that  Pw(m),  the 
autocorrelation  function  of  {W£} ,  satisfies  the  third  order 
difference  equation 

7a  -  ca  +  1)7^  +  (A  *  «V2  -  7,-3  - 


0. 


a 


It  Is  also  Important  to  see  that  Theorem  2  implies 

S1(ox(m))  5  0  ,  S1((-l)*ox(«))  5-2, 

and 

S3(pw(m))  5  0  ,  S3((-l)apw(m)  S  -  2(2W2). 

With  this  example  in  mind,  we  are  now  In  a  position  to  outline  the 
general  procedure  for  detecting  and  estimating  the  parameters  of  seasonal 
factors  present  in  a  process  which  is  being  observed.  In  the  following 
it  will  be  assumed  that  a  realization  . . .  ,x^}  bas  been  obtained 

from  an  A8MA.  process  of  the  form  in  (2) ,  where  all  of  the  roots  of  the 
characteristic  equation  $(x)  "  0  lie  on  or  outside  the  unit  circle. 

The  first  step  in  simply  detecting  the  presence  of  seasonal  factors 
(which  should  of  course  follow  an  initial  look  at  a  plot  of  the  data)  is 

A  A 

an  examination  of  the  sample  S-arrays  (i.e.,  the  S-arrays  for  p  asd  (-1)  p  ). 

m  m 

Theorem  2  implies  that  seasonal  factors  are  in  the  process  if  and  only  if 
one  of  the  following  situations  holds: 

(i)  S , (f  )  =  c  for  some  j  >  2 

j  WL  — 

(ii)  S^)  5  -2 

(iii)  S^C-l)®^)  5  0 

If  N  is  reasonably  large.  Theorem  3  assures  us  that  this  constancy  behavior 
in  the  parametric  S- array  should  also  be  evident  in  the  sample  S-array. 

Thus,  in  order  to  detect  seasonal  factors  the  sample  S-array  should  be 
examined  for  the  presence  of  a  near  constant  column.  (The  interpretation 
of  "near  constant"  will  be  elaborated  on  later.)  Upon  detecting  a  nonsta- 
tionarity  in  the  process,  Theorem  1  suggests  clearly  that  the  next  step 
in  identifying  the  full  order  of  the  process  is  to  operate  on  the  data  by 

r 

the  correct  nonstationary  operator.  The  reason  for  this  is  that  the  auto¬ 
correlation  function  of  a  nonstationary  process  may  satisfy  a  difference 
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equation  of  order  less  than  p  (where  p  is  the  order  of  $ (x) ) ;  and  there¬ 
fore  the  constancy  behavior  expected  in  the  p^  column  of  the  sample 
S-array  may  be  completely  obscured  by  the  presence  of  nonstationary 
factors.  If,  however,  the  data  is  transformed  by  the  correct  non¬ 
stationary  operator,,  the  resulting  S-array  may  be  examined  for 


additional  stationary  or  nonstationary  factors.  The  example  to  be 
presented  later  should  make  these  ideas  clear. 

The  question  of  how  to  choose  the  parameters  of  the  nonstationary 

(seasonal  or  nonseasonal  nonstationary)  operator  now  arises.  In  the 

discussion  of  this  problem  it  should  be  understood  that  process  factors 

whose  zeroes  are  not  on  but  only  close  to  the  unit  circle  will  also 

induce  a  near  constant  column  in  the  sample  S-array.  Thus,  when 

referring  to  the  problem  of  estimating  the  parameters  of  a  nonstationary 

operator,  we  leave  open  the  possibility  that  the  appropriate  operator 

is  mathematically  stationary  but  has  zeroes  which  are  close  to  the  unit 

circle.  The  procedure  to  be  suggested  for  choosing  the  parameters  con- 

sists  of  two  stages.  Suppose  that  the  k  column  of  the  sample  S  array 

til 

is  near  constant,  or  (as  sometimes  occurs  in  practice)  that  the  k 
column  and  one  or  more  columns  previous  to  the  kC^  are  nearly  constant. 


The  first  stage,  then,  in  choosing  the  parameters  of  the  nonstationary 
operator  is  the  fitting  of  a  k^  order  autoregressive  model  to  the  data, 
estimating  parameters  by  the  Yule-Walker  method  (see  Box  and  Jenkins 
(1976)).  After  fitting  the  model,  the  roots  of  the  resulting  characteristic 
equation  should  be  examined.  Associated  with  each  complex  conjugate  pair 


of  roots  a^  ±  ib^ 
frequency  ^ 


(which  may  be  indexed  by  ^ j  "  aj  +  >  bj  >  0)  is  the 


.  For  real 


70 


roots,  Xj  “  a^  and  we  have 


(0  ,  a  >  1 


UJ1! 


The  and  |X^  |  can  be  used  in  the  same  way  that  a  spectrum  would  be 
used  to  compare  the  relative  contribution  o£  each  frequency  to  the  over¬ 
all  variation  in  the  process.  The  and  |xj^|  have  an  advantage  over 

the  spectrum,  however,  in  that  the  are  not  "smeared"  as  they  would  be 

in  the  spectrum  and  the  |X^|  can  be  used  to  determine  stationarity  and 
nonatationarity.  Note  that  the  closer  [Xj^j  is  to  1  the  closer  the 
corresponding  factor  is  to  being  nonstationary.  The  here  may  be  thought 
of  as  the  natural  frequencies  of  the  process  rather  than  as  the  harmonics 
in  a  Fourier  series  expansion.  This  initial  fitting  of  a  model  and  the 


subsequent  examination  of  roots  should  be  used  as  an  investigative  proce¬ 
dure  to  simply  identify  the  nature  of  the  nonstationary  (or  nearly 
nonstationary)  factors  in  the  process  being  observed.  The  preliminary 
nature  of  the  initial  fit  indicates  why  Yule- Walker  estimates  were 


suggested  rather  than,  for  example,  MLEs  (under  the  assumption 
2 

Zt  'v  N(0,a  ))  since  the  computing  time  required  to  calculate  MLEs  is 
usually  much  greater  than  chat  needed  for  Yule-Walker  estimates. 

The  second  stage  in  the  parameter  estimation  problem  involves 
choosing  between  a  nonstationary  and  a  stationary  model.  In  part, 
resolving  this  question  requires  us  to  elaborate  somewhat  on  the  meaning 
of  the  phrase  "near  constant  column  in  the  sample  S-array.”  Besides  a 
simple  examination  of  the  kC^  column  (the  "constant"  column  referred  to 
previously) ,  another  method  for  determining  the  degree  of  constancy  is 
to  form  one  (or  both)  of  the  ratios 


n 


1 


R, 


i-i 


0,k 


-1 


H.k-  <“« 


^  _  r /  -k+i* 

E  S.((-l)  P.k+1 

k+1  i»i  k  _ _ 


) 


Z  Sk^-k-J 

i-£-l  k  L 


V1  _k  , 

E  S  ((-1)  k  L$  ) 
i-£-l 


where  is  constant  which  depends  on  the  record  length  and  l  >  2. 

A  rule  of  thumb  which  has  proven  useful  in  practice  is  to  give  serious 
consideration  to  a  nonstationary  model  whenever  |R^  >  .95.  In  order 

to  see  the  rationale  behind  this  rule  of  thumb  recall  that  for  a  polynomial 

2  k 

1  +  axx  +  a2x  +  . . .  +  a^x 


i\i 


k 

R 

i-1 


-1 


r^  |  ,  where  x2,r2 . 


(3) 


are  the  k  zeroes  of  the  polynomial.  It  can  also  be  shown  (see  Woodward 
and  Gray  1979)  that 


c  11ten  v<-»w 


V»-k> 


Sk((-Uk 


"k  » 


where  ^  is  the  Yule-Walker  estimate  for  the  k  coefficient  of  an  auto¬ 
regressive  process  of  order  k.  Because  of  the  assumed  constancy  behavior 
in  the  k*^1  column,  R,  .  is  obviously  another  (and  usually  better) 
estimate  for  the  ktk  coefficient,  and  thus  if  [R^  >  .95  it  can  be 

seen  from  (3)  that  most  of  the  |r^Zj  for  a  fitted  model  will  necessarily 
be  close  to  one.  (It  should  be  remembered  that  we  have  disallowed  roots 
inside  the  unit  circle  in  all  our  discussion.)  R.  .  is  thus  informative 
for  purposes  of  choosing  between  a  nonstationary  and  a  stationary  model. 
Of  course,  once  a  ktb  order  model  has  been  fit  the  |r'Z|  can  be  examined 

<  j 

directly  as  has  already  been  described.  The  use  of  R^ 


though,  is  still 
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helpful  as  a  preliminary  means  of  checking  whether  an  observed  "constancy" 
behavior  is  evidence  of  a  nonstationarity  in  the  process.  Ultimately, 
however,  the  decision  to  fit  a  nonstationary  model  should  also  be  based 
upon  an  understanding  of  the  physical  aspects  of  the  time  series  under 
consideration  and/or  the  desired  nature  of  the  forecast  function. 

Once  the  decision  has  been  made  to  fit  a  nonstationary  model,  a 
method  of  estimating  the  parameters  of  the  model  is  needed.  In  order  to 
outline  one  method,  first  suppose  that  it  has  been  decided  to  treat  the 
factors  associated  with  (where  the  A^'s  are  a  subset  of  the 

roots  obtained  in  the  preliminary  ■  Yule-Walker  fit)  as  nonstationary 
factors.  The  suggested  procedure  for  obtaining  the  seasonal  or  non- 
seasonal  nonstationary  model  is  to  adjust  these  factors  from 

(l-A”1*)  (1-A  ~l  B)  -  1  -  2a i  I A^1 1 2B  +  |  A"1 1 2B2 
2 

to  1  -  ctjB  +  B  where 

2 

<*  -  sgn[tan(2iro>  )]  ■  ■  ■  - -  r1 

^  ^  /tan2(2muj)+l 

2 

Note  that  the  frequency  associated  with  the  adjusted  factor  (l-a^B+B  ) 
is  still  the  natural  frequency  but  that  the  zeroes  of  this  factor  are 
on  the  unit  circle.  This  adjustment  is  analogous  to  what  is  frequently 
done  in  the  special  case  ■  0  when  the  data  is  differenced.  (If  •  0 
or  Aj  is  real  and  the  adjusted  factors  are  1-B  and  1+B  respectively.) 
Since  the  adjusted  factor  depends  only  upon  oi^,  another  possible  method 
for  obtaining  the  nonstationary  factor  is  to  determine  from  a  spectral 
analysis.  A  spectral  analysis  alone  is  not  sufficient  to  determine  that 
a  factor  is  nonstationary ,  but  it  may  be  helpful  in  determining  precisely 


once  it  is  determined  that  the  factor  associated  with  is  nonstationary 
(i,e.,  that  Ix^n). 

In  order  to  identify  the  full  order  of  the  model,  the  data  should  be 
transformed  by  either  the  operator  from  the  initial  Yule-Walker  fit  or  the 
adjusted  operator.  Since  it  has  been  observed  that  the  S-array  values  for 
transformed  data  are  sometimes  sensitive  to  slightly  different  transform¬ 
ing  operators,  it  is  not  always  clear  which  of  the  two  operators  should 
be  used.  For  this  reason,  examining  the  S-arrays  for  both  transformations 
is  often  useful  for  purposes  of  identifying  the  full  order  of  the  model. 

In  practice  it  will  sometimes  happen  that  one  transformation  will  Induce 
a  clear  pattern  of  constants  in  some  column  of  the  S-array,  whereas  the 
other  transformation  will  not.  Since  such  a  pattern  of  constants  indicates 
that  the  sample  autocorrelation  of  the  transformed  data  nearly  satisfies  a 
difference  equation,  examining  the  S-arrays  of  both  transformations  may 
actually  indicate  which  one  of  the  original  transformations  provides  a 
better  fit  to  the  data. 

III.  Modeling  the  International  Airline  Data 

In  order  to  illustrate  the  method  which  has  just  been  discussed  for 
modeling  nonstationary  data,  we  will  obtain  models  for  the  well-known 
international  airline  data  using  this  method.  The  airline  data  (see  Fig.  1) 
was  first  analyzed  by  Box  and  Jenkins  (1970)  and  is  made  up  of  144  monthly 
totals  of  airline  passengers.  As  Sox  and  Jenkins  have  pointed  out,  if 
{Y„}  is  the  airline  series  and  X  ■  InY  ,  then  the  series  {X  }  is  more 
compatible  with  the  linearity  and  homoscedasticity  assumptions  inherent  in 
AKMA  models  than  is  the  series  {?c>.  Their  model  is  thus  for  {Xfc}  as  will 
be  the  models  in  the  present  analysis.  The  Box  and  Jenkins  model  is 


Airline  Data 
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(l-B)(l-B12)Xt  -  a-0.4B)(l-0.6B12)Zt  (4) 

which  is  arrived  at  by  a  consideration  of  the  physical  aspects  of  the 

airline  data.  It  will  be  shown  in  the  present  analysis  how  models  similar 

to  (4)  may  be  obtained  by  allowing  the  data  to  "speak  for  itself." 

Following  the  outline  in  the  previous  section,  the  first  step  in  our 

analysis  is  an  examination  of  the  sample  S-array.  A  portion  of  the  S-array 

for  (-l)°p  is  shown  in  Table  1.  Note  the  strong  degree  of  constancy  in 
m 

the  first  column.  With  t  ■  2  and  ■  8  we  have  |R^  1(  *  .96,  which  is 

evidence  that  a  nonstationary  or  nearly  nonstationary  first  order  factor 

is  in  the  process.  From  our  discussion  earlier,  an  estimate  for  this 

factor  is  (1-.96B).  The  constancy  behavior  seen  in  column  thirteen  of 

the  S-array  may  be  explained  after  the  data  is  transformed  by  (1-.96B). 

The  S-array  (using  for  the  transformed  data  is  seen  in  Table  2. 

Column  twelve  of  this  array  exhibits  a  constancy  behavior  which  seems 

to  be  of  the  type  which  characterizes  stationary  autoregressive  processes 

(see  Cray,  Kelley,  and  Mclntire  1978) .  However,  upon  fitting  a  12C^ 

order  Yule-Walker  model  to  the  transformed  data  and  examining  the  roots 

of  the  resulting  characteristic  equation  (Table  3) ,  it  is  seen  that  the 

observed  constancy  behavior  is  actually  indicative  of  an  operator  which 

12 

is  quite  nearly  the  seasonal  operator  (1-B  ) .  From  Theorem  2,  if  a 

process  is  of  the  form 

(1-B12)*  (B)X  -  9(B) Z,. 

s  t  t 

(where  *g(B)  is  a  stationary  operator),  then  si2^pm^  5  °*  rt  is  now 
clear,  then,  that  the  values  in  column  12  of  the  sample  S-array  are 
estimates  (in  the  sense  of  an  average)  of  some  value  near  zero,  and  the 
observed  constancy  behavior  is  thus  consonant  with  the  theory  which  has 
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_1_ 

£2 

previously  been  discussed.  By  considering  j  Rq  ^1  for  the  transformed 
data  the  initial  confusion  could  have  been  avoided.  IR0,12^ 

.970  (1*2,  a^  *  6) ,  which  clearly  indicates  that  the  constancy  in  the 
12Clx  column  is  evidence  of  a  seasonality  in  the  data. 

12 

Table  3  exhibits  clearly  that  an  operator  quite  similar  to  (1-B  ) 

should  be  included  in  our  model  for  the  airline  data.  The  frequencies 

associated  with  the  roots  of  the  fitted  characteristic  equation  are  quite 

12 

compatible  with  those  of  1  -  x  *  0,  and  the  root  -1.0811  is  the  only 

one  which  might  not  be  considered  sufficiently  close  to  the  unit  circle. 

For  reasons  of  simplicity  and  parsimony,  then,  the  models  in  the  present 

12 

analysis  will  include  the  operator  1-B  .  At  this  point,  if  it  is  not 

desired  to  treat  the  factor  (1-.96B)  as  nonstationary,  a  reasonable 

12 

procedure  would  be  to  fit  a  model  to  the  series  (1-B  )Xt.  The  S-array 

12 

for  the  data  transformed  by  (1-B  )  indicates  that  the  transformed  data 

may  be  adequately  modeled  as  a  13 ^  order,  stationary  autoregressive 
process.  A  reasonable  initial  model  for  the  airline:  data  would  thus  be 

(l-B12)*1(B)Xt  -  Zt  ,  (5) 

where 

^(B)  -  1  -  .535B  -  .272B"  +  .057B3  -  .018B4  -  .087B5 
-  .04 IB6  +  .076B7  -  .038B8  -  .158B9  +  .136B10 
+  .155B11  +  .Z87B12  -  .295BU  . 

An  estimate  of  the  variance  of  Zfc  for  tills  model  is  a2  *'.00127.  If  the 
original  data  is  transformed  by  (1-B) (1-B*2) ,  a  12^  order,  stationary 
autoregressive  process  is  a  satisfactory  model  for  the  transformed  data. 

Another  contending  model  would  thus  be  ’ 

* 

(l-B)(l-B12)^2(B)Xt  -  Zt  (6) 


where 


18 


Table  3 


Roots  (Xj)  of  characteristic  equation  from 


model  fitted 

to  transformed 

data 

h 

frequency 

(X±) 

l*ili 

1.0607 

0 

.9427 

.8747  +  (.5039)1 

.0832 

.9906 

.5012  +  (.8757)1 

.1673 

.9911 

.0165  +  (1.0285)1 

.2474 

.9721 

-.5091  +  (.8763)1 

.3338 

.9867 

-.8912  +  (.5025)1 

.4183 

.9774 

-1.0811 

.5000 

.9250 

frequency  (r^ 


X 

.8660  +  (.50)1 
.50  +  (.8660)1 
+1 

-.50  +  (.8660)1 
-.8660  +  (.50)1 


0 

.0833 

.1667 

.2500 

.3333 


r 

i 

l 

i 

l 

i 

i 


i 


5000 
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}2(B)  -  1  +  . 358B  +  .054B2  +  .151B3  +  .110B4 
-.047B5  -  .089B6  +  .015B7  -  .031B8  -  .164B9 
-.036B10  +  .081B11  +  .339B12  and  a*  “  * 00136  * 

A-  A 

For  illustrative  purposes  $2(B)  and  ^(B)  were  fit  by  the  Yule-Walker 

method,  although  MLEs  may  be  desirable  for  a  more  refined  model. 

Approximate  standard  errors  for  the  coefficients  in  these  operators 

are  given  in  Table  4.  (The  standard  errors  were  found  under  the 

2 

assumption  that  Z^Nfa.a  )  by  using  the  approach  of  Box  and  Jenkins 
(1976).) 

It  is  clear  that  many  other  reasonable  models  could  be  obtained  by 

using  different  operators  of  the  form  (1  -  $B)  (B)  where  $12(B)  13  some 

12 

12th  order  operator  similar  to  1  -  B  .  Models  (5)  and  (6)  certainly  seem 

adequate,  however,  and  they  illustrate  well  the  method  for  modeling 

seasonal  data  through  S-arrays.  Mote  that  model  (6)  has  two  roots  of 

unity  and  will  thus  have  a  forecast  function  possessing  a  linear  trend; 

whereas  the  forecast  function  for  model  (5)  will  not  contain  the  linear 

trend  for  forecasts  at  long  lead  times  since.  Instead  of  two  units  roots, 

rhla  model  has  one  unit  root,  one  root  equal  to  1.345,  and  an  additional 

low  frequency  component.  In  general,  this  kind  of  a  difference  between 

the  forecast  functions  of  stationary  and  nonstatiouary  models  can  provide 

a  means  of  choosing  between  competing  models. 

A  comparison  of  forecasts  obtained  from  models  (4)  and  (5)  may  be  seen 

in  Fig.  2-4.  The. forecasts  are  made  from  origins  24  months,  36  months,  and 

k  a  2 

48  months  prior  to  the  end  of  the  data  set.  SSE  (i.e.  £  (xc  +i-xfc  +i)  ) 

i-1  o  o 

for  model  (4),  the  Box-Jenkins  model,  is  seen  to  be  more  erratic  than  that 
of  model  (5)  in  the  sense  that  it  does  not  increase  monotonically  as  the 
— riw  number  of  steps  ahead  to  be  forecast  increases.  The  reason  for  this 


appears  to  be  that  the  trend  component  of  the  forecast  function  for  model 


Model  (5)  SSE  -  .0203 
Figure  2 
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(4)  is  determined  by  only  13  values  previous  to  the  forecast  origin;  whereas 
the  corresponding  component  for  model  (5)  is  determined  by  25  preceding 
values.  Since  a  close  inspection  of  the  data  reveals  evidence  of  a  24  month 
period,  it  is  understandable  why  a  forecast  function  using  only  13  months 
prior  to  the  forecast  origin  might  perform  eratically.  When  one  considers 
that  model  (5)  was  fit  through  a  purely  data  analytic  approach  it  is  not 
surprising  that  its  forecast  function  contains  the  low  frequency  component 
not  found  in  model  (4) . 

For  an  interesting  and  more  complete  discussion  of  forecasts  obtained 
from  (4) ,  (5) ,  and  a  model  proposed  by  Farzen  see  Gray  and  Woodward  (1980) . 

IV.  Conclusion 

A  method  of  modeling  nonstationary  AKMA  processes  (with  special  emphasis 
on  seasonal  processes)  has  been  examined  in  this  paper.  A  brief  outline  of 
this  method  is  as  follows: 

(i)  Detect  the  presence  of  nonstationary  factors  by  examining 
the  sample  S-array. 

(ii)  Determine  the  nature  of  the  detected  nonstationary  factors 
by  fitting  an  appropriate  Yule-Walker  model  to  the  data. 

(141)  Transform  the  data  by  the  operator  obtained  from  the  Yule- 
Walker  fit  or  an  adjusted  operator  and  examine  the  S-array 
of  the  transformed  data  for  the  presence  of  additional 
stationary  or  nonstationary  factors. 

(lv)  After  identifying  the  full  order  of  the  model,  decide  which 
factors  are  to  be  treated  as  nonstationary.  Transform  the 


data  by  the  nonstationary  factors  and  fit  a  model  of 
appropriate  order  to  the  transformed  data. 

Using  this  methodology,  it  has  been  shown  how  models  for  the  international 
airline  data  may  be  obtained  through  a  data  analytic  technique  rather  than  by 
simply  considering  the  physical  aspects  of  the  data. 
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